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Abstract. The Split algorithm is an essential building block for route-first cluster-second 
heuristics and modern genetic algorithms for vehicle routing problems. The recent survey of 
[Prins, Lacomme and Prodhon, Transport Res. C (40), 179-200] lists more than 70 articles that 
use this technique. In the vehicle routing literature, Split is usually assimilated to the search for 
a shortest path in a directed acyclic graph Q and solved in 0{nB ) using Bellman’s algorithm, 
where n is the number of delivery points and B is a bound on the number of customers per route, 
due to the capacity constraints. Some linear-time algorithms are also known for this problem as a 
consequence of a Monge property of Q. In this article, we highlight a stronger property of this 
graph, leading to a simpler alternative algorithm in 0(n). Experimentally, we observe that the 
approach is faster than the classical Split for problem instances of practical size. We also extend 
the method to deal with a limited fleet and soft capacity constraints. 

Keywords. Vehicle Routing Problem, Large Neighborhood Search, Split Algorithm, Cluster-First 
Route-Second Heuristic 


1 Introduction 

The algorithm of Prins (2004) was an important milestone for the vehicle routing problem (VRP): 
it was the first hybrid genetic algorithm with local search to outperform classical tabu searches 
at a time when such methods were predominant. One main ingredient of its success was its 
approach to solution representation and recombination. Until the 2000s, combining two solutions 
was considered a difficult task, because simple crossover operators had a tendency to produce 
infeasible and unbalanced routes. To meet this challenge, Prins (2004) represented the solution as a 
permutation of visits, a “giant tour”, and relied on a dynamic-programming-based decoder, called 
Split , which optimally inserts depot visits to obtain complete solutions. This makes it possible to 
efficient use classical crossovers for permutations, since the Split algorithm is in charge of route 
delimitations, and the capacity constraints are implicitly managed during solution decoding. 
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Ten years on, the literature on population-based methods for VRPs has grown extensively. 
Efficient Split-free GAs with more advanced crossover operators now exist (e.g., Nagata and 
Braysy 2009), a sign that the Split algorithm is useful but not a necessity. Nevertheless, the 
approach of Prins (2004) is simple and generic, and most modern GAs for the VRP are still based 
on giant-tour solution representations. The Split algorithm has also been generalized to address 
many constraints and decision sets via dynamic programming, leading to successful algorithms 
for a large number of problem variants, as surveyed in Duhamel et al. (2011), Vidal et al. (2012), 
Prins et al. (2014), and Laporte et al. (2014). 

The computational efficiency of the Split algorithm is the main focus of this article. Consider 
a giant tour (1,..., n) containing n customers with demands gi,..., q n E [0, Q], and a set of 
vehicles of capacity Q located at a depot. Let be the distances between two successive 

customers, and cfoi and d \o be the distances from and to the depot. Classically, the Split algorithm 
is assimilated to the search for a shortest path between 0 and n in the acyclic graph Q — (V,M), 
where V = (0,... ,n) and where each arc (i, j) E A for i < j corresponds to a feasible route 
starting at the depot, visiting customers i + 1 to j, and returning to the depot (Beasley 1983, 
Prins 2004). In the vehicle literature, such a shortest path is usually obtained in 0{nB ) via 
a variant of Bellman’s algorithm, where B is a bound on the number of deliveries per route 
as a consequence of the capacity constraints. Moreover, for a limited fleet of m vehicles, the 
propagation of the labels is iterated to produce a shortest path with fewer than m edges in 
0(nmB). Such complexity is suitable for most medium-scale applications. However, Split can 
become a computational bottleneck for very large problems with few routes, when used iteratively 
in a metaheuristic. 

To meet this challenge, we will introduce a new Split algorithm in 0{n). Note that some 
linear-time algorithms are already known for this shortest path (see Burkard et al. 1996, Bein 
et al. 2005, and the references therein) as the graph Q satisfies the Monge property: 

c(hji) + c(i 2 , J 2 ) < c(ii,j 2 ) + c(i 2 , ji) for all 0 < h < i 2 < ji < h < n, (1) 

where c(z, j ) is the cost of an arc (z, j ). So far, these methods were not applied in the VRP literature. 

In this article, we propose an even simpler alternative, based on the fact that the auxiliary 
graph Q satisfies the following stronger property: 

for all 0 < i\ < i<i < n, there exists K E M such that 
c(ii, j) — c(i 2 , j) — K for all j > £2 such that (A, j) E A. 

We show that Property (2) can be used to eliminate dominated predecessors and retain only good 
candidates, leading to a very simple labeling algorithm in 0{n) which performs well in practice 
and can be efficiently used in VRP metaheuristics. The approach is also extended to produce a 
solution of the Split problem with a limited number of vehicles in 0(nm), and with soft capacity 
constraints in 0(n). 

Finally, we compare the practical CPU time of the proposed method with that of the classical 
Bellman-based algorithm, using giant tours built from TSP instances. These instances contain 
from n = 29 to 71,009 nodes, and the number of deliveries per route ranges from 4 to 4, 000. The 
linear approach appears to be faster in most cases, with speedups ranging from 0.8 x to 400 x. 
The largest speedups are achieved for instances with many deliveries per route, which can occur 
in courier delivery, refuse collection, and meter reading applications. 
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The remainder of this paper recalls the Bellman-based Split algorithm in Section 2, introduces 
the proposed linear Split in Section 3, discusses its generalization to limited fleets and soft capacity 
constraints in Section 4, and reports our computational experiments in Section 5. To facilitate the 
use of these algorithms in future generations of heuristics, a C++ implementation of the methods 
of this paper is available at http://wl.cirrelt.ca/~vidalt/en/VRP-resources.html. 

2 Bellman-based Split Algorithm 

Split is traditionally based on a simple dynamic programming algorithm, which enumerates the 
nodes in topological order and, for each node £, propagates its label to all successors i such that 
(£, z) E A. The presentation in Algorithm 1 is similar to that of Prins (2004). The edge costs are 
not preprocessed but directly computed in the inner loop. This specific algorithm was used as a 
benchmark in our computational experiments in Section 5. 


Algorithm 1: Classical Split Algorithm 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 


p[0] <— 0 

for t — 1 to n do 

[ p[t] <— oo 
for t = 0 to n — 1 do 
load E- 0 
i i — t + 1 

while i < n and load < Q do 
load E- load + qi 
if i = t + 1 then 
| distance E- 
else 

j distance E- distance + <A-i^ 
if p[t\ + cost + dio < p[i] then 
p[i] = p[t] + cost + <Ao 
pred[i] — t 
i i + 1 


At the end of each iteration t (lines 5-16 of Algorithm 1), p[t\ contains the cost of a shortest 
path from 0 to t. The array of predecessors pred is maintained throughout the search so that we 
can retrieve the solution at the end of the algorithm. 

3 Split in Linear Time 

This section will introduce a more efficient Split algorithm. We first describe some auxiliary data 
structures and the data for a numerical example. 
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Preliminaries. We define for i G {1, ..., n} the cumulative distance D[i\ and cumulative load 
Q[i\ as follows: 


i—l 

D[i\ = dk,k +1 

(3) 

k =1 


i 

Q[i\= 

k =1 

(4) 


These values can be preprocessed and stored in 0{n) at the beginning of the algorithm. For 
i < j , the cost c(i, j) of an arc (i, j) is the cost of an edge leaving the depot, visiting customers 
(i + 1,..., j), and returning to the depot, computed as 

c (hj) = ^o,z+i + D\j] — D[i + 1] + (5) 

and the arc (i, j) exists if and only if the route is feasible, i.e., Q\j] — Q[i\ < Q. 

Our algorithm also relies on a double-ended queue, denoted A, that supports the following 
operations in 0(1): 

front - accesses the oldest element in the queue; 
front2 - accesses the second-oldest element in the queue; 
back - accesses the most recent element in the queue; 
push-back - adds an element to the queue; 
popjront - removes the oldest element in the queue; 
popJback - removes the newest element in the queue. 

We will refer to the elements of the queue as (Ai,..., A|a|), from the front Ai to the back A|a|- 

Data for the Numerical Example. To illustrate the algorithm, we use a numerical example 
with 12 nodes. Figure 1 provides the input distances from the depot and between successive nodes 
as well as the demands associated with each node. For this instance, the best solution consists of 
the routes (0,1,2,3,4,0), (0,5,6,7,8,9,0), and (0,10,11,12,0). 



Main algorithm. For t G {1,... ,n}, our algorithm iteratively finds a best predecessor for t and 
sets the cost of a shortest path from 0 to t (line 4). Instead of iterating over the predecessors 
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and storing the minimum cost in 0(n), as in Bellman’s algorithm, it takes advantage of the cost 
structure of the Split graph and stores nondominated predecessors in the queue A (lines 7-12) in 
order to find a best predecessor in 0(1). 


Algorithm 2: Linear Split 


1 p[0] <— 0 

2 A (0) 

3 for t — 1 to n do 


4 

5 

6 

7 
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10 

11 

12 


p[t] <— p[front\ + f (front, t ) 
pred[t\ <— front 

if t < n then 

if not dominates(back,t ) then 

while |A| > 0 and dominates(t, back) do 
j popBackQ 
pushBack(t) 

while Q[t + 1] > Q + Q[front] do 
| popFrontQ 


The definition of the boolean function dominates(i, j) completes the algorithm: 


dominates(i, j) 


(p\i\ + d 0 ,i+i -D[i + 1] < p[j] + do,j+i ~ D\j + 1] and Q[i\ = Q\j] if i < j 
1 p[i\ + do, i+i - D[i + 1] < p[j] + d 0 ,j+i - D\j + 1] if i > j. 


Correctness of the algorithm. Define f(i,x) as the cost achieved when extending the label 
of a predecessor i to a node x G {i + 1, ..., n}. This function takes an infinite value if the arc 
(i,x) ^ A because of capacity constraints: 


f(i,x) 


p[i] + c(i, x) Q[x\ - Q[i\ < Q 

oo otherwise. 


Furthermore, define the auxiliary function gi(x) = f(i,x) — D[x] — d x q. This function of x 
takes a constant value as long as the label extension is feasible, since 

if Q[x\ - Q[i\ < Q, gi(x) = p[i] + d 0 ,;+i + D[x\ - D[i + 1] + d x0 - D(x) - d x0 

= p[i\ + do,i+i — D[i + 1]. 

Figure 2 displays the functions gi(x) of the numerical example at iteration t — 7. 
consider two candidate predecessors i and j such that i < j <t. The expression f(i,x) — f(j,x) 
represents the cost difference between extending the predecessor i to a node x > t and extending 
j to the same node. Note that f(i,x) — f(j,x) — gi(x) — gj(x). Therefore, each predecessor i < t 
can be characterized by: 

— a fixed cost gi — p[i\ + do,i+i — D[i + 1], 

— and the cumulative demand li = Q[i\. 


(7) 

Now, 
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Figure 2: Numerical example - The functions gi{x) are displayed for i G {1,..., 6}. Note that 
the function go(x) does not appear (takes an infinite value) since (0,7) ^ A. 

These two values lead to the dominance relation of Equation (6): if gi > gj and U > lj , then 
gi(x) > gj(x) for any x > t and candidate j is always at least as good as i. 

Algorithm 2 has been designed to satisfy the following invariant at line 4: 

Proposition 1 (Loop Invariant) A contains mutually nondominated predecessors oft ranked 
by increasing cost: for i G {1,..., |A| - 1}, g Xi < g\ i+ 1 and l x . < l Xi + i- Moreover, Q[t} <Q + l Xl . 

This proposition is true for t = 1 since A = (0) and q\ < Q. Then, any new node t will be added 
to A only if it is not dominated by the current back element (lines 7 and 10). If t is added, the 
loop at lines 8-9 removes from the queue any node that is dominated by t, starting from the back 
of the queue and stopping as soon as the first nondominated element is found. Since the elements 
are ordered by increasing cost, this guarantees the removal of all predecessors dominated by t. 
Finally, when the index t is incremented, any predecessor at the front of the queue that cannot 
be feasibly extended to t + 1 is eliminated to ensure that Q[t + 1] < Q + l Ai- 

As a consequence of this invariant, the first element of the queue is always a best predecessor 
for t. Indeed, it is a feasible predecessor, and all other predecessors in the queue have a greater 
cost. Furthermore, any other element that is no longer in A was either dominated or could not be 
feasibly extended to a node x > t. This proves the correctness of the algorithm. 

In terms of complexity, we remark that each node i cannot be added to the queue more than 
once via pushBack or deleted more than once via popBack or popFront. As a consequence, the 
operations of lines 7-12 are performed at most n times, leading to an overall complexity of 0(n). 

4 Extensions of the Linear Split Algorithm 

This section describes two extensions of the proposed algorithm, for the Split problem in the 
presence of a limited number of vehicles, and for the case where linear penalties are imposed if 
the capacity is exceeded. 
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4.1 Limited Number of Vehicles 

The extension of the algorithm to a limited number of vehicles requires us to perform the previous 
algorithm once for each vehicle. The resulting approach is described in Algorithm 3. 


Algorithm 3: Linear Split: Fleet limited to m vehicles 


l for k — 1 to m do 


for t = 0 to n do 
j p[k, t\ = oo 
0 


2 

3 

4 p 

5 for k = 0 to m — 1 do 


0 , 0 ] 


6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 
17 


clear(A) 

A <- 0fc) 

for t = k + 1 to n s.t. |A| > 0 do 

p[k + 1, t\ <— p[k, front] + f (front, t ) 
pred[k + 1 ][t\ <— front 

if t < n then 

if not dominates(k, back, t) then 

while |A| > 0 and dominates(k,t, back) do 
| popBackQ 
pushBack(t ) 

while |A| > 0 and Q[t + 1] > Q + Q[front\ do 
| popFrontQ 


For k G {1,... m} and t G {k ,..., n}, the two-dimensional array p[k, t] will contain the cost 
of a shortest path with k edges finishing at t. These costs are computed for increasing k in an 
outer loop (line 5) and for increasing t in the inner loop (line 8). The cost of any label p[k + 1 ,t\ 
is obtained from the extension of a best predecessor p[k,i\ with i < t. Applying the Bellman 
algorithm in the inner loop would lead to a complexity of 0(n 2 m). Instead, we use the queue 
data structure and dominance properties as in the previous section, leading to a complexity of 
0(n) in the inner loop, for a total complexity of 0(nm). 

The inner loop can be stopped once A is empty (line 8). In this state, the algorithm has 
reached the last index that can be feasibly attained with k routes. The minimum cost of a route 
containing m vehicles is given at the end of the algorithm by p[m,n], and the two-dimensional 
array pred enables us to trace back the solution. The minimum cost of a route containing k < m 
vehicles can also be found, by seeking the minimum of p[k, n], for k G {1, ..., m}. Note that the 
dominates function takes the number of vehicles k as an extra argument—since it considers the 
two-dimensional array p[k,i\ instead of p[i\ —but its purpose remains the same. 
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4.2 Soft Capacity Constraints 

Consider the case where the capacity of a route may be exceeded, subject to a linear penalty with 
coefficient a. This relaxation of the capacity constraints is useful in practical situations where 
the demand of a customer represents a time or workload quantity rather than a physical load 
in a truck, and where an excess may be acceptable. This relaxation is also useful in heuristics, 
allowing them to better explore the search space via intermediate infeasible solutions and adaptive 
penalties (Gendreau et al. 1994, Cordeau et al. 1997, Vidal et al. 2015a). 

All arcs (i, j) such that i < j are now included in A , and the cost of an arc (i, j) is 

c(ij) = rf 0 ,i+i + D[j\ - D[i + 1] + djfl + ax max{Q[j] - Q[i\ - Q, 0}. (8) 

Main algorithm. Algorithm 2 can still be applied subject to two changes: 

1. The dominates function is updated to account for the constraint relaxations: 


dominates (i^j) 


(p[i\ + d 0j t+i -D[i + 1] + a x (Q[j] - Q\i]) < p[j] + doj+i ~ D\j + 1] if i < j 
1 p[i\ + d 0i i+i - D[i + 1] < p[j] + doj+i ~ D[j + 1] if i > j. 


2. The rule for eliminating the front label in A, at line 11, becomes: 

while |A| > 1 and p[front] + /( front , t + 1) > p[front2] + /(/ron£2, t + 1). 


Correctness of the algorithm. We rely on the same principles as before. The cost f(i,x) of 
the extension of a node i to a node x G {i + 1,..., n} and the functions gi{x) are defined as: 


f{i,x) — p[i\ + c(i, x) (10) 

9i(x) = f(i , x) - D[x\ - d x o (11) 

= p[i\ + do,i+i - D[i + 1] + a x max{Q[x] - Q[i\ - Q , 0}. (12) 

Again, gi{x) < gi(x) means that j is dominated by i as a predecessor. Now, we define the 
function hi(q ) for q G R as: 

hi(g) = p[i] + do,i+i - + 1] + a x max{g - Q[i] - Q, 0}. (13) 

Note that hi(Q[x]) = giix). If hi(q) < hj(q ) for gGl, then j is dominated by i. 


The functions hi(q) are illustrated in Figure 3 for the numerical example of Section 3. 
Each function is piecewise linear and continuous with two pieces: a constant piece with value 
hi — p[i] + g?o,z+i — D[i + 1] for q < Q[i\ + Q and an increasing piece with slope a. 

We define two values that characterize the predecessor candidates: 

— the fixed cost hi — p[i] + do,z+i — D[i + 1], and 

— the cumulative demand l L — Q[i\. 

For i < j , Q[i\ < Q[j] and one can verify that {hi{q) < hj(q ) Mq G M} {hi+ax (lj — k) < hj}. 
For i > j, Q[i\ > Q[j] and one can verify that {hi(q) < hj(q) Vg G 1} {hi < hj}. These 
conditions lead to the dominance relation of Equation (9). We now show that the following loop 
invariant is respected at line 4 of the algorithm: 
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Figure 3: Numerical example: the functions hfix) are displayed for i G {0,..., 6}. 

Proposition 2 (Loop Invariant) A contains mutually nondominated predecessors oft ranked 
by increasing fixed cost: for i G {1,..., |A| - 1}, h Xi < h Xi + 1 , l Xi < l Xi + 1 , and h Xi + a(l Xj - l x .) > 
h Xj • Moreover, h Xl (Q[t ]) < h X2 (Q[t]). 

This proposition is true for t = 1 since A = (0). Then, any new node t is inserted at 
the back of A only if it is not dominated by the current back element, which implies that 
h x | A | + a(l Xt — Za| A j ) > h\ t • Then, the algorithm eliminates all dominated nodes i such that 
h Xt < h Xi until it finds the first node that satisfies the invariant condition. Finally, when t is 
incremented, any front node that does not satisfy the condition h Xl {Q[t\) < h X2 (Q[t]) is eliminated. 

Now we show that this invariant implies that the front node is a best predecessor at each 
iteration t. First, h Xl < h Xk and l Xl < l Xk for any k > 1, so there is no better predecessor in A. 
Second, any other predecessor i that does not appear in A has either been eliminated because it is 
dominated by another predecessor in A, or because it was the front element at an iteration t' < t 
and the last condition of Proposition 2 applied. In this specific case, for the second element j we 
have li < lj and hj(Q[t']) < hi(Q[t']). Because of the shape of the functions h, this also implies 
that hj(Q[t]) < hi(Q[t]) for any t > t' , so j is an equal or better predecessor. In both cases, the 
predecessor i has been eliminated from A only if a better candidate exists, and we have shown 
that the front element is a best predecessor in A. 

5 Computational Experiments 

The previous section has introduced a linear Split algorithm and its extensions to a limited fleet 
and soft capacity constraints. 

We now evaluate experimentally the speedup of the new approach compared to the classical 
Bellman-based algorithm of Section 2. We generated a set of 105 benchmark instances containing 
information on the giant tour, the distances between successive nodes, the distances from and to 
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the depot, and finally the demand for each node. Each instance is based on a data set from the 
TSPLib (http://comopt.ifi.uni-heidelberg.de/software/TSPLIB95/) or the World TSP 
(http://www.math.uwaterloo.ca/tsp/world/countries.html). We produced the giant tour 
using the Lin-Kernighan heuristic of Helsgaun (2000), and we selected as the depot the node that 
is the closest to the barycenter of the nodes. We generated the demand for each node randomly 
with a uniform probability in [1, 50]. These instances have between 29 and 71,009 nodes. For each 
instance, we considered ten vehicle capacities: Q G {10 2 , 2x 10 2 , 4x 10 2 ,10 3 , 2x 10 3 ,4x 10 3 ,10 4 , 2x 
10 4 ,4 x 10 4 , 10 5 }. Some (instance, capacity) pairs are eliminated from the set because the capacity 
of a single vehicle exceeds the total demand of the customers. 

We implemented the algorithms of this paper and the original Bellman-based Split in C++. 
We implemented the double-ended queue A as an array of size n with front and back pointers. 
Overall, these algorithms use simple data structures and elementary arithmetic, limiting possible 
bias related to programming style or implementation skills. The code is available at http: 
//wl.cirrelt.ca/~vidalt/en/VRP-resources.html. 

We ran the algorithms for each instance and capacity level on a Xeon 3.07 GHz CPU, using a 
single thread. The small and medium instances were solved quickly, and so we performed multiple 
runs in a loop to obtain accurate CPU time measurements. We calibrated the number of runs to 
achieve a CPU time of about 10 to 60 seconds per instance. 

Hard capacity constraints. Table 1 reports the CPU time, in milliseconds, of the Bellman- 
based Split algorithm and our new linear-time algorithm on a selection of twelve instances. The 
speedups for all 105 x 10 (instance, capacity) pairs are represented graphically in Figure 4, where 
each section of the figure represents a different capacity Q. 

Both algorithms appear to be reasonably fast in the presence of an unlimited fleet and hard 
capacity constraints. For both methods, the CPU time grows linearly as a function of n when the 
capacity Q is fixed, since B is also fixed. The time of the Bellman-based algorithm ranges from a 
fraction of milliseconds for small and medium instances with short routes, up to one second for an 
instance with 71, 009 nodes and Q = 100, 000. For the same instance, the linear split algorithm 
does not exceed three milliseconds. Split is used extensively in modern population-based heuristics 
for the VRP (e.g., between 10,000 and 50,000 times per run in Prins (2004) and Vidal et al. 
(2012)), so a small CPU time is essential. 

As illustrated in Figure 4, the overall speedup between the linear algorithm and the Bellman- 
based version grows linearly with the capacity Q, which is itself proportional to B. The break-even 
point in terms of route size—beyond which the linear algorithm is faster—is Q = 100, when the 
routes have an average of four customers. Therefore, the linear Split algorithm is beneficial for 
most VRP applications. For large instances and long routes, the benefits of the proposed Split 
algorithm are significant, with speedups greater than 300x. 

Note that the speedup as a function of n, for a fixed value of Q, is not exactly constant but 
instead slightly concave. This can be explained by a combination of effects. First, for small values 
of n, the inner loop of the Bellman-based algorithm is slightly faster because it is limited by 
the end of the giant tour. Second, the CPU time required for initialization and access of the 
arrays D[i\ and Q[i\ may not be exactly linear as a function of n, due to reduced efficiency of the 
memory cache on large problems. 
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Table 1: CPU time (ms) of Split: using Bellman or linear algorithm 


Inst 

n 

Q= 

^Bellman 

100 

^Linear 

Q= 

^Bellman 

200 

^Linear 

Q= 

^Bellman 

500 

^Linear 

Q=1,000 

^Bellman ^Linear 

Q=2,000 

^Bellman ^Linear 

wi29 

28 

1.12xlCT 3 

1.24xl0~ 3 

1.76x 10~ 3 

1.33xl0~ 3 

1.90x 10~ 3 

1.43xl0~ 3 

— 

— 

— — 

eil51 

50 

2.31 xlCT 3 

1.79xl0~ 3 

2.78x 10~ 3 

1.92xl0~ 3 

4.35x 10~ 3 

2.18xl0~ 3 

4.70x 10~ 3 

2.14xl0~ 3 

— — 

rdlOO 

99 

4.74x 10 -3 

3.44xl0“ 3 

5.88x 10 -3 

3.71 xl0“ 3 

9.30x 10 -3 

3.91 xl0“ 3 

1.40xl0“ 2 

3.88xl0 -3 

1.80xl0“ 2 3.97xl0“ 3 

dl98 

197 

8.00xl0“ 3 

6.18xl0 -3 

1.17xl0 -2 

6.26xl0 -3 

1.92xl0 -2 

6.26xl0 -3 

2.84xl0 -2 

5.92xl0 -3 

4.48xlO -2 5.84xl0 -3 

A417 

416 

1.55xl0~ 2 

1.24xl(T 2 

2.31 xKT 2 

1.30xl0~ 2 

4.04 xl0~ 2 

1.40xl0~ 2 

7.04 xl0~ 2 

1.43xl0~ 2 

1.15xl0 _1 1.41 xl0~ 2 

prl002 

1001 

4.09x 10~ 2 

3.40xl0~ 2 

6.08xl0~ 2 

3.38xl0~ 2 

9.75xl0~ 2 

3.37xl0~ 2 

1.73X10- 1 

3.51 xl0~ 2 

3.02x 10 _1 3.37xl0~ 2 

mul979 

1978 

8.43x 10~ 2 

6.84xl0~ 2 

1.17x 10 _1 

6.90xl0~ 2 

1.98X10- 1 

6.69xl0~ 2 

3.41 xlO^ 1 

6.78xl0~ 2 

6.14x 10 _1 6.96xl0~ 2 

fnl4461 

4460 

1.83x 10 _1 

1.55xl0 _1 

2.68x 10 _1 

1.60xl0 _1 

4.54x 10 _1 

1.62xl0 _1 

7.65x 10 _1 

1.59xl0 _1 

1.41 1.57xl0 _1 

kz9976 

9975 

4.25xl0 _1 

3.67xl0 _1 

6.14xl0 _1 

3.75xl0 _1 

1.02 

3.70xl0 _1 

1.62 

3.63xl0 _1 

2.84 3.41xl0 _1 

dl8512 

18511 

7.99xl0 _1 

6.76xl0 _1 

1.15 

7.14xl0 _1 

1.92 
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Figure 4: Speedups of the linear Split over the Bellman-based algorithm for all 105 instances. 
The sections of the graph correspond to different values of Q. In each section, the x-axis indicates 
the number of nodes n in each instance. A logarithmic scale is used for both axes. 
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Figure 5: Speedups for the case with a limited fleet. 


Limited Fleet. Figure 5 presents results in the same format for the Split algorithm with a 
limited fleet. In these experiments, the maximum fleet value m is set to the optimal number of 
vehicles obtained from the unlimited Split algorithm. The algorithm with a limited fleet returns 
the best solution for any number of routes k < m. The conclusions are similar to those of the 
previous case, with speedups ranging from lx to 447x. The CPU times are a factor of m higher 
than in the previous case for all instances. The largest CPU time for both algorithms, around 
58 seconds, occurs for the largest instance with Q = 100, a regime with short routes but many 
vehicles, where the Bellman-based and linear Split algorithms perform equally. To further reduce 
the CPU time in these cases, one could rely on advanced algorithms for minimum weight fc-link 
paths on graphs with the Monge property (Aggarwal et al. 1994), or explore a heuristic Split 
based on a Lagrangian relaxation of the fleet-size limit. 

Soft capacity constraints. Finally, Figure 6 displays the speedups for the Split algorithm 
with an unlimited fleet and soft capacity constraints. Allowing an unlimited excess capacity puts 
the Bellman-based algorithm at a disadvantage, since the set list of candidate predecessors cannot 
be reduced by feasibility checks, leading to a complexity of 0{n 2 ) instead of 0{nB). To avoid 
this issue, we can limit the total capacity to a reasonable value. We considered Q' = 4Q, which 
reduces the CPU time of the Bellman-based Split algorithm while retaining its purpose, and we 
display both sets of results. The black dots indicate the results for the unlimited case, and the 
gray dots indicate the results for the limited case. 

The 0(n 2 ) growth of the Bellman-based Split in the unlimited case can be observed on the 
figure: it leads to a linear growth of the speedup as a function of n, up to 7187x for the largest 
instance. The speedups are smaller but still significant when the comparison is with the Bellman 
algorithm with the 4 Q bound: from 2x to 1800x. The maximum CPU time of the linear Split 
algorithm, for the largest problem instance, is 4.37 milliseconds, compared to 6.3 and 16.6 seconds 
for the Bellman-based algorithms. 
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Figure 6: Speedups for soft capacity constraints. Two sets of results are presented: the speedups 
relative to the Bellman algorithm with no limit on the excess capacity (black dots), and those 
relative to the Bellman algorithm with a limit of 4 Q on the total demand of a route (gray dots). 

6 Conclusions 

In this article, we have introduced a simple and efficient Split algorithm in 0(n). The algorithm 
uses dominance properties and can be extended to deal with a limited number of vehicles or 
relaxed capacity constraints. Our computational experiments show that the new algorithm is 
significantly faster than the usual Bellman-based approach on VRPs of a realistic size. Positive 
speedups are encountered when the number of deliveries per route is greater than four. For large 
problems with 70,000 deliveries and few routes, the speedup reaches 400x. 

There are multiple opportunities for future research. One may consider a more systematic 
use of the Split algorithm in heuristic searches, either by further exploring the space of the giant 
tours (Prins 2004, 2009) via restricted local searches, or by using the linear Split algorithm as an 
implicit route-evaluation procedure (Vidal 2015, Vidal et al. 2015b) for multi-attribute VRPs 
with multiple trips per vehicle, intermediate facilities, or recharging stations. 

Furthermore, the same predecessor-filtering techniques may be useful for some nonpolynomial 
versions of the Split algorithm, e.g., for location-routing problems or VRPs with a heterogeneous 
fleet (Duhamel et al. 2011). 

Finally, many multi-attribute vehicle routing and scheduling problems, with additional con¬ 
straints, decision sets, and objectives can be modeled via resources, resource constraints, and 
their extension functions on the routes (Desaulniers et al. 1998, Irnich 2008). Based on this 
formalism, it would be profitable to take a step back and consider current algorithms in a more 
general perspective, identifying which properties of the extension functions allow for more efficient 
Split algorithms and other neighborhood-evaluation procedures. This is an important task, as 
the success of modern vehicle routing metaheuristics is, for a large part, conditioned by the 
computational complexity of their simplest building blocks. 
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